Adaptive topological tree structure for document organisation and visualisation

نویسندگان

  • Richard T. Freeman
  • Hujun Yin
چکیده

The self-organising map (SOM) is finding more and more applications in a wide range of fields, such as clustering, pattern recognition and visualisation. It has also been employed in knowledge management and information retrieval. We propose an alternative to existing 2-dimensional SOM based methods for document analysis. The method, termed Adaptive Topological Tree Structure (ATTS), generates a taxonomy of underlying topics from a set of unclassified, unstructured documents. The ATTS consists of a hierarchy of adaptive self-organising chains, each of which is validated independently using a proposed entropy-based Bayesian information criterion. A node meeting the expansion criterion spans a child chain, with reduced vocabulary and increased specialisation. The ATTS creates a topological tree of topics, which can be browsed like a content hierarchy and reflects the connections between related topics at each level. A review is also given on the existing neural network based methods for document clustering and organisation. Experimental results on real-world datasets using the proposed ATTS method are presented and compared with other approaches. The results demonstrate the advantages of the proposed validation criteria and the efficiency of the ATTS approach for document organisation, visualisation and search. It shows that the proposed methods not only improve the clustering results but also boost the retrieval.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using icicle trees to encode the hierarchical structure of source code

This paper presents a study which evaluates the use of a tree visualisation (icicle tree) to encode the hierarchical structure of source code. The tree visualisation was combined with a source code editor in order to function as a compact overview to facilitate the process of comprehending the global structure of a source code document. Results from our study show that providing an overview vis...

متن کامل

Knowledge Translation in Healthcare – Towards Understanding its True Complexities; Comment on “Using Complexity and Network Concepts to Inform Healthcare Knowledge Translation”

This commentary argues that to fully appreciate the complexities of knowledge transfer one firstly has to distinguish between the notions of “data, information, knowledge and wisdom,” and that the latter two are highly context sensitive. In particular one has to understand knowledge as being personal rather than objective, and hence there is no form of knowledge that a-priori is more authoritat...

متن کامل

Self-optimised Tree Overlays using Proximity-driven Self-organised Agents

Hierarchical structures are often deployed in large scale distributed systems to structure communication. Building and maintaining such structures in dynamic environments is challenging. Self-organisation is the approach taken in this chapter. AETOS, the Adaptive Epidemic Tree Overlay Service, provides tree overlays on demand. AETOS uses three local agents to this purpose (i) to translate appli...

متن کامل

Application of Graph Theory: Relationship of Topological Indices with the Partition Coefficient (logP) of the Monocarboxylic Acids

It is well known that the chemical behavior of a compound is dependent upon the structure of itsmolecules. Quantitative structure – activity relationship (QSAR) studies and quantitative structure –property relationship (QSPR) studies are active areas of chemical research that focus on the nature ofthis dependency. Topological indices are the numerical value associated with chemical constitution...

متن کامل

Adaptive Fuzzy Decision Tree with Dynamic Structure for Automatic Process Control System of Continuous Cast Billet Production

The article outlines the results of adaptive fuzzy decision tree development with dynamic structure for automatic process control system of continuous cast billet production. The authors described the structure, mathematical representation as well as adaptation algorithms and structure dynamics.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neural networks : the official journal of the International Neural Network Society

دوره 17 8-9  شماره 

صفحات  -

تاریخ انتشار 2004